Optimization Algorithms for Identification and Genotyping of Copy Number Polymorphisms in Human Populations
نویسندگان
چکیده
Recent studies show that copy number polymorphisms (CNPs), defined as genome segments that are polymorphic with regard to genomic copy number and segregate at greater than 1% frequency in the populations, are associated with various diseases. Since rare copy number variations (CNVs) and CNPs bear different characteristics, the problem of discovering CNPs presents opportunities beyond what is available to algorithms that are designed to identify rare CNVs. We present a method for identifying and genotyping common CNPs. The proposed method, POLYGON, produces copy number genotypes of the samples at each CNP and fine-tunes its boundaries by framing CNP identification and genotyping as an optimization problem with an explicitly formulated objective function. We apply POLYGON to data from hundreds of samples and demonstrate that it significantly improves the performance of existing single-sample CNV identification methods. We also demonstrate its superior performance as compared to two other CNP identification/genotyping methods.
منابع مشابه
Genotyping Analysis of rs1799989 Single Nucleotide Polymorphism in TYR Gene Region in the Population of Isfahan, Iran
Background & Aims: Tyrosinase is the most important enzyme in the production of pigments of the skin, eyes, and hair follicles. The enzyme is encoded by tyrosinase gene (TYR) or oculocutaneous albinism type 1A (OCA1A). Mutations in TYR gene result in pigmentation disorders such as albinism in humans. In view of the large number of mutations reported in this gene, the aim of this study was to id...
متن کاملEffect of Combining Multiple CNV Defining Algorithms on the Reliability of CNV Calls from SNP Genotyping Data
In addition to single-nucleotide polymorphisms (SNP), copy number variation (CNV) is a major component of human genetic diversity. Among many whole-genome analysis platforms, SNP arrays have been commonly used for genomewide CNV discovery. Recently, a number of CNV defining algorithms from SNP genotyping data have been developed; however, due to the fundamental limitation of SNP genotyping data...
متن کاملCNAT 4.0: Copy Number and Loss of Heterozygosity Estimation Algorithms for the GeneChip® Human Mapping 10/50/100/250/500K Array Set
Introduction There exists evidence of correlations between carcinogenesis and genetic alterations in tumor cells. These alterations involve allelic imbalances, manifested in the form of chromosomal copy number changes – amplifications, deletions, aneuploidy, loss of heterozygosity, micro-satellite instability among others [1]. These events frequently indicate the activation of oncogenes for exa...
متن کاملIdentification of the Rare, Four Repeat Allele of IL-4 Intron-3 VNTR Polymorphism in Indian Populations
Background: Cytokines are cell signaling molecules which upon release by cells facilitate the recruitment of immune-modulatory cells towards the sites of inflammation. Genetic variations in cytokine genes are shown to regulate their production and affect the risk of infectious as well as autoimmune diseases. Intron-3 of interleukin-4 gene (IL-4) harbors 70-bp variable number of tandem repeats (...
متن کاملA Novel Simple Method for Determining CYP2D6 Gene Copy Number and Identifying Allele(s) with Duplication/Multiplication
BACKGROUND Cytochrome P450 2D6 (CYP2D6) gene duplication and multiplication can result in ultrarapid drug metabolism and therapeutic failure or excessive response in patients. Long range polymerase chain reaction (PCR), restriction fragment length polymorphism (RFLP) and sequencing are usually used for genotyping CYP2D6 duplication/multiplications and identification, but are labor intensive, ti...
متن کامل